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Abstract. We modify (classical) jump-robust estimators of integrated real- 
ized covajriance to estimate the Fourier coefficients of the instantaneous sto- 
chastic covariance. By means of a central limit theorem for those Fourier 
coefficient estimators we are then able to prove consistency and a (pointwise) 
central limit theorem for the non-parametrically reconstructed instantaneous 
covariance process itself. The procedure is — by methods of Fourier analysis 
— robust enough to allow for an iteration and wc can therefore show theoret- 
ically and empirically how to estimate the integrated realized covariance of 
the instantaneous stochastic covariance process. We also explain a surprising 
shrinkage phenomenon for the constructed Fourier estimators, i.e., in compar- 
ison to classical (local) estimators of instantaneous variance the asymptotic 
estimator variance of the Fourier estimator is smaller by a factor 2/3, but an 
additional (pointwise small) bias appears. We apply these techniques to robust 
calibration problems for multivariate modeling in finance, i.e. the selection of a 
pricing measure by using time series and derivatives' price information simul- 
taneously. "Robust" here means that re-calibration is more stable over time, 
that the estimation procedures of, e.g., instantaneous covariance also work in 
the presence of jumps, and that the procedures are as robust as possible with 
respect to input deficiencies. 



1. Robust Calibration 

The recent difficulties in the banking and insurance industry are to some extent 
due to insufficient modefing of multivariate stochastic phenomena which appear in 
financial markets. There are several reasons why modeling is insufficient, but the 
two most important ones arc the following: first, realistic multivariate models are 
difficult to calibrate to market information due to a lack of analytic tractability, 
hence oversimplified models are in use in delicate multivariate situations, and, sec- 
ond, usually either time series data or derivatives' prices are used to select a model 
from a given model class but not both sorts of available information simultaneously. 
We do not speak here about phenomena which should not be considered stochas- 
tic and where stochastic modeling is inappropriate at all, but we concentrate on 
multivariate time series of liquid instruments, where appropriate and easy to im- 
plement stochastic modeling is needed. Applications are risk management, pricing 
and hedging of instruments, and portfolio management. 
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We aim to develop methods which allow for robust calibration, i.e., estimation 
and calibration of a model in a well specified sense simultaneously from time series 
and derivatives' prices data in order to select a pricing measure. It is useful to 
demonstrate what we actually mean with robust calibration by means of an example: 
take a Heston model with possibly state-dependent jumps in the log-price process 
and variance 



where X is the stochastic variance process and Y the log-price of a stock. and 
L"^ denote the possibly state-dependent jump parts. Through robust calibration we 
have to identify the initial value Xq, Yq, the five parameters /i, n, 9, a and the cor- 
relation between the Brownian motions p, and the compensators of the jump parts 
in order to specify the model for purposes of pricing, hedging or risk management. 

Apparently at least the initial values Xq, Ko, and the parameters a and p do not 
change under equivalent measure changes, so in principal the parameters Xq,Yq, 
(7, p can be identified from the observation of a single trajectory. On the other hand 
market implied values for those parameters should coincide with vahics estimated 
from the time series if the model is close to correct. Here "market implied values" 
means to choose model parameter values such that the model's derivatives' prices 
and the market prices coincide as well as possible. From basic theory of mathe- 
matical finance we know that the calculation of derivatives' prices has to be done 
under an equivalent martingale measure for the process exp(F), but Xq, Vo, f , p do 
not change under equivalent measure changes. 

Formally speaking we have defined the above model on a filtered probability 
space (O, J',P) and we consider equivalent measures Q ~ P. Having specified a 
set of parameters and a semimartingale depending on parameters G 0, we 
can then define an equivalence relationship, namely Q\ ~ Qi if S'f^P = •S'^f^Q, i.e., 
equality for the respective measures on the canonical probability space of cadlag 
paths, for some Q ^ P. This equivalence relation defines orbits on 9 and the 
space of orbits 9/ ~, where the latter set is the set of invariant parameters, i.e., 
those parameters of 5^, which are not touched by equivalent measure changes. For 
sufficiently rich parametric models this set is usually not trivial. 

Having this basic stochastic fact in mind, it should be clear that non-parametric 
estimation of instantaneous covariance processes is the important task to be per- 
formed, since it yields in the previous concrete case information on cr, p and 
the trajectory of instantaneous stochastic variance t ^ Xt along the observation 
interval. The parameters k and Q, the characteristics of the jumps will rather 
be calibrated from derivatives' prices (except the Blumenthal-Getoor index), since 
they cannot be identified from time series information. The difference between the 
martingale measure identified through calibration and the real world measure is 
most delicate to estimate from the time series information: we do not comment on 
this issue here. 

To be precise on time series data: we think of intraday price data for liquid 
instruments along periods of months up to years, such that we have about 10^ data 
points available without loosing assumptions on time-homogeneity. 

We call a calibration procedure robust, if it follows the subsequent steps: 
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(i) estimate parameters which are invariant under equivalent measure changes 
from time series by econometric means together with respective confidence 
sets. 

(ii) cahbrate the partiahy specified model via a non-linear (robust) pricing 
operator to market prices and determine the rest of the parameters. Here 
we do neither assume one model anymore, nor do we expect the pricing 
operator to provide one price, compare to the recent work ^3]. 

(iii) reject the model if the second step calibration is producing time- inconsistent 
or time-dependent results when monitored along a time series of deriva- 
tives' prices. 

There is also another reason why time series methods should enter the field of 
multivariate model selection in mathematical finance. In fact, due to the high di- 
mensional parameter space and the lack of liquidly traded options whose payoffs 
depend on several underlyings, calibration is particularly challenging. This diffi- 
culty can be tackled to a certain extent by additionally using time series of asset 
prices, from which - according to the above calibration principle - those parameters 
which remain invariant under equivalent measure changes can be determined. 

Based on this concept, the goal of the present article is to find methods which 
allow to estimate non-parametrically the instantaneous stochastic variance of the 
instantaneous stochastic variance process X, and, to estimate instantaneous sto- 
chastic correlation between the log-price process Y and the stochastic variance 
process X. This involves a two step procedure where we first need to recover the 
realized path of the instantaneous variance, from which we can then - by iteration 
of the estimator - estimate the second order quantities. 

In the literature the problem of identification of a pricing measure under time 
series information is usually solved by setting up high-dimensional filtering prob- 
lems. From the filtering point of view our method can be understood as choosing 
- by direct non-parametric estimation techniques - excellent priors concentrated 
around invariant parameters in order to initialize the filtering procedure. 

2. A REMARK ON PATHWISE ESTIMATION PROCEDURES 

Robust calibration refers to splitting the task of model identification into an 
estimation and a calibration subtask. This leads to the necessity of finding pathwise 
estimation procedures for, e.g., instantaneous covariance processes, where we look 
for robustness rather in the statistical sense. More precisely the presence of non- 
equidistant and non-equal grids for different log-price processes should not influence 
the procedures, i.e., we have to encode market information into quantities which 
depend in continuous ways on the observations. This is the topic of this second 
introductory section, where we argue why to apply Fourier estimation techniques 
as outlined, e.g., in [9j. We are aware that more classical estimation methods 
for instantaneous covariance could also be applied for our purposes, but we try to 
argue that Fourier techniques might appear more robust in the light of the following 
arguments. 

We consider multivariate stochastic models to describe the joint stochastic evo- 
lution of several price processes for purposes of pricing, hedging and forecasting. 
Such models typically incorporate 

• a stochastic covariance process, yielding stochastic volatility of each single 
asset and stochastic correlation between the different assets, and 
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• jumps, whose jump measures can exhibit a rich dependence structure. 

We assume that we discretely observe a path of the log-price process with mod- 
erately high frequency (up to minutes) and we would like to estimate from this 
path stochastic covariance as well as possible. Since the problem is multivariate we 
should also care about the following two problems: 

• the observation grid might be not equidistant. 

• observation grids of different prices might differ. 

This amounts to develop a theory which has an additional robustness, or in other 
words, it amounts to use estimators where the gridding plays a minor role. Both 
aspects are discussed in this introductory section from an basis expansion point of 
view. 

The estimation of invariant parameters from discrete observations of a path of 
a scmimartingalc y on a finite interval [0,T] is related to the question how to 
represent this information in a robust way, which in this context simply means as 
continuous as possible. We would like to take a particular point of view on this 
question, which allows to unify several approaches present in the literature. Con- 
sider L^i[0,T]) the Hilbert space of complex valued, square-integrable functions on 
the interval [0, T] and a finite orthonormal system (ei, . . . , e„) in it. The orthogo- 
nal projection of an element / G L^([0,T]) onto the span of (ei, . . . ,e„) can then 
be represented as 7r„(/) = J2^=i {f^^i)^ hence the calculation of the coordinates 
{f,ei), for z = 1, . . . ,n is key if we want to calculate the projection 7r„(/). The 
coordinates arc continuous functions of the full paths with respect to the L^- and 
even to the L°°-topology, but continuity is more delicate an issue when it comes to 
coefficients of the "increment" process, i.e., when we aim to represent the derivative 
of/ 

'^n{f)=y^ei / ei{s)df{s). 
i=l -^0 

Here apparently continuity of the coordinates is not generically true (since, e.g., not 
every element / € L^([0,T]) will be differentiable), but remains true if integration 
by parts works, for instance if Cj G C^, for i = 1, . . . , n. The representation of the 
derivative /' is important in our setting since this corresponds to the representation 
of the increments of a stochastic process Y. 

Let us assume now that the path of the semimartingale Y is observed on time 
points = < < ... < t"!^ = T: this set of time points can be encoded in 
the ("discrete observation") orthonormal system = — ^p=^l[(r,_^ jn] for i = 

1, . . . ,n. Then the discrete observation (i.e., observation together with their time 
points of observation) is bijectively encoded through the following sum of stochastic 
integrals 

i=l i=l "^0 

if tacitly Yq := is assiimcd. Since many estimators will be formulated as func- 
tions of increments, we have to understand the previous expression's continuity 
properties. In particular this means that proper encoding in the case of discrete 
observations should rather be formulated in terms of stochastic integrals with re- 
spect to the path than in terms of integrals along the path. However, stochastic 
integrals are known to be rather measurable than continuous for generic integrands 
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on path spaces. Only when the basis lunctions satisly some differentiability prop- 
erties allowing for integration by parts, i.e., for e G C"'^([0,r]) 

e{s)dY, = YTe{T) - / Y,e'{s)ds , 
Jo 

the coordinates ei{s)dYs are continuous with respect to the path on path space. 
It might therefore be wise 

• to formulate the theory in a coordinate-free way, i.e., estimators can have 
inputs coming from different orthonormal systems. 

• to allow a change of coordinates for the estimated quantities, i.e., when 
the input is given on a discrete grid (which corresponds to a basis decom- 
position with respect to (e")i<i<„), then output could be produced with 
respect to any other reasonable orthonormal system. 

We shall not fully follow this program, since we do start with discrete observa- 
tions, i.e., coordinates with respect to (£")i<i<„i but we shall formulate the theory 
such that we can estimate consistently, together with a central limit theorem, co- 
ordinates {ei,X) for i = 1, . . . ,n with respect to any chosen orthonormal system 
in L^([0,T]). In [8] additionally all estimators are written in terms of Fourier- 
coordinates (e^, X) for i = 1, . . . , n, which has several procedural advantages with 
respect to non-equidistant or non-equal gridding. We insist that we could formulate 
our theoretical findings also in this spirit, but in this article we start by writing 
estimators functional on increments. The main idea now is the following: classical 
estimators of the form 

n 

F"(x)^^(c-tr-i).9 
t=i 

will play a major role, where - under continuity assumptions on g - continuous 
dependence on discrete observations is clear. On refining grids we shall often obtain 
consistency, i.e., 

V^{X)^ f Pg{Xs)dS 

Jo 

for some function pg in a consistent way, possibly together with a central limit 
theorem. The announced "change of coordinates" is then performed through the 
following sequence of estimators 

n 

(2.1) VJ'{X)=J2{t2-t7-i)e,{f2)g 

1=1 

for j = 1, . . . , A^, which converge to 

/ ej{s)pg{Xs)ds, 
Jo 

for j = 1, . . . , iV in a consistent way together with a central limit theorem. This new 
interpretation of estimators for integrated quantities allows to obtain estimators 
for coordinates of the spot quantity with respect to any other orthogonal system 

i^i)i=l,...,N- 

In view of robustness, which means continuity here, we choose a coordinate 
system whose coordinates depend as continuous as possible on the observation 
path t Pg{Xt) in order to smoothen input deficiencies. This advantage will be 
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important for the estimation of quantities related to X as we shall do it at the 
end of this work. In particular we believe that the change of coordinates from 
the orthogonal system (ef )i<i<7v another, possibly more smoothing system like 
the Fourier basis, {ei)^^^ jq, can be of advantage when it comes to quality of 
approximation. Notice also that V-^ with respect to the basis (£i^)i<j<]v (where 
the basis elements are given on a coarser grid corresponding to N) would correspond 
just to the well-known local realized variance estimators on sliding, non-overlapping 
windows. 

Having constructed estimators for coordinates with respect to a basis (e;),-^j^ ^y, 
we know the projection of the path on the space spanned by (ej)^_-^ j^, but since 
every coordinate comes with an estimation error, we have to understand how those 
errors influence the rcconstriiction. This is the very reason why we introduced 
another cardinality N < n oi the (reconstruction) orthonormal system. The choice 
of N amounts to understanding the size of sums with random error coefficients £i 

N 
i=l 

which is small in the Z/^-sense on [0, T] and on the observation probability space if 
Ei=i H^i] is small. The size of this sum determining the L -error will depend on 
the variance of and on the number N. We see here that - even though we would 
like to take N <n as large as possible - the accuracy of reconstruction is bounded 
by the size of the previously described error, which grows with A^. 

If we ask for the L°°-error in reconstructing the path we should rather consider 
t J2iLi ei{t)ai£i, where a, are an additional weighting sequence optimizing the 
L°°-approximation (e.g., Fejer weights in case of the Fourier basis). This leads to 
an error random variable with expectation 

N 
i=l 

and second moment 

N 

ei{t)ej{t)aiajE[£i£j] , 

for t G [0,T]. Assuming for a moment that the errors for the estimation of different 
coordinates are i.i.d. with vanishing expectation, then the estimator variance equals 

N 

Var[fi] ^(e,(t))'a2, 

i=l 

for t G [0, T]. This error quantity depends on the particular choice of the orthonor- 
mal system and the structure of the errors; its size might differ considerably. In 
particular very localized orthonormal systems rather lead to truncated sums of er- 
rors, such that "diversification" effects do not take place. We therefore believe that 
orthonormal systems like the Fourier basis might bring better results in some cases. 

Another aspect for the choice of orthonormal systems besides continuity of rep- 
resentation of path information and control of L°° -errors is related to locality of 
the estimator: when we ask for an estimated value of Pg{Xf) at one point in time 
t, we can use in principle all the information along the observed path to infer that 
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value. How information is weighted as a function of time distance to t is strongly 
influenced by the choice of the basis. Local estimators, like most of the classical 
estimators, use a comparably small sliding window to estimate the instantaneous 
quantity pg{Xt), where other, non-local estimators use exponential weights, or al- 
gebraically decreasing weights of information along the path. 

The Fourier-Fejer estimator, which we introduce in the sequel, satisfies all out- 
lined requirements: we have continuity on path space of functionals K i— >■ J ei{s) dYg. 
Error control might lead to "diversification effects" , since sums are never truncated, 
and the estimators are non-local, i.e., the information of the whole time series enters 
into estimators of instantaneous covariance. We therefore expect that the estimator 
variance of Fourier-Fejer estimators might be smaller than for estimators written 
with respect to the basis (£f^)i<i<jv' which is indeed the case as proved in Theorem 



6.6 see also Remark 6.7 On the other hand an additional bias will (has to) appear, 
which, however, is pointwise quite small. 

This shrinkage phenomenon is confirmed by Figure [l] below, which shows a com- 
parison between the classical local realized variance estimator and the Fourier-Fejer 
estimator. In particular, the variance of the Fourier-Fejer estimator is comparable 
with the one of a James-Stein shrinkage variant of the classical estimator. 

In our illustration example the underlying semimartingale y is a drifted Brow- 
nian motion with constant variance, that is, 

dYt = bfdt + VxdZt, 

where denotes the drift, Z a standard Brownian motion and X the determin- 
istic constant variance, which we aim to measure on a coarser grid given discrete 
observations of Y. The Fourier-Fejer estimator X"'^ is described in detail in the 



following sections and given by (6.2) (here applied with g(y) = and T ~ 1). As 



we shall prove in the sequel it is asymptotically normal N{X, ^^^^) as n, — )• oo 
and lim — K for some constant K and 7 > 1. In comparison, the classical 
estimator is given by 



with ef(t) = 1 „ „ (t) if = i for J = l,...,N and = ^ for 



i = 1, . . . ,n, which corresponds to 



where VJ^{X) is given in (2.1 1 with ej = ef and g{y) — y^. According to [SI 

Theorem 13.3.3], Xf is asymptotically normal J^{X,2X^^) as n, N ^ 00 and 
lim = K for some constant K and 7 > 1. In particular, the variance of the 
Fourier-Fejer estimator is | of the one of the classical estimator. A similar variance 
reduction phenomenon can be achieved by applying the James-Stein estimator to 
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-n,N 



and considering the following shrinkage estimator: 
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where M denotes the number of evaluation points of X 
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Figure 1. Comparison of estimators 

The remainder of the article is organized as follows. In Section [3] we introduce 
the assumptions on the log-price and the instantaneous covariance process and 
Section [4] gives an overview of the different steps in our estimation procedure. 
Section Is] is dedicated to jump robust estimators for the Fourier coefficients, while 
in Section [6] consistency and a central limit theorem are proved for the Fourier- 
Fejer instantanous covariance estimator. In Section [7] we consider estimators for 
the integrated covariance of the instantaneous covariance process and Section [8] 
concludes with simulation results. 



3. Setting and assumptions 

Throughout we work on a filtered probability space (f^, J^, (-7^t)o<t<T, P), where 
we consider a d-dimensional (discounted) asset price process (5'()o<t<Tj which is 
supposed to be nonnegative, and adapted to the filtration {Tt)o<t<T ■ Due to the 
non-negativity of S we further assume 

St = (exp(rta), . . . , eMYud)V, 0<t<T, 

where (l'f)o<t<T denotes the c?-dimensional (discounted) logarithmic price process 
starting at Fg = j/ G E'' a.s. Due to no-arbitrage consideration S and thus also Y 
are supposed to be semimartingales with a rich structure of jumps. 
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Furthermore, let us introduce some mild structural assumptions on the log-price 
process Y, namely that it is an Ito-semimartingale of the following form: 

Assumption (H). The logarithmic price process Y satisfies 

Yt=y+ f bids + f ^/xlldZ, + f [ xmi^^{dt ds) - K,{dS,)ds) 
Jo Jo Jm."^ 



(3.1) 





t 



{^~x{W{di,ds), 



where y/X denotes the unique matrix square root on S'j' , the space of positive semi- 
definite matrices, and 

• Z is a d-dimensional Brownian motion, 

• 6^ an R"^ -valued locally bounded process, 

• X a cddldg process taking values in S'^ and 

• ijX {d^, dt) the random measure associated with the jumps ofY, whose com- 
pensator is given by Kt{d^)dt, where Kt{dS^) = Ktiw, dS,) is for each (w, t) 
a measure on . 

Remark 3.1. (i) Usually, e.g., in [5], the assumption of an Ito-semimartingale 

is formulated in terms of the Gringelions representation, which means that 
there exists an extension of the probability space, on which are defined a 
d'-dimensional Brownian motion W and a Poisson random measure p with 
Levy measure A such that 

Yt^y+ f blds+ f a,dWs+ f f Si^ s-)l[^^s\\<iMd^,ds) - Xid^ds) 
Jo Jo Jo Jr'^ 



+ 



/ / Sits-)l{\\s\\>i}p{dtds), 
Jo Jw 



where at is an M'^^'* -valued predictable process such that a^a = X and S 
is a predictable E''-valued function on fl M+. In view of applications 

to (affine) processes, whose characteristics are given in terms of represen- 



tation (3.1), we prefer to use the formulation of Assumption (H) and do 
not use the Gringelions representation, since it involves an extension of 
the probability space and the specific form of 5 and p is not evident, 
(ii) The conditions on the characteristics in Assumption (H) correspond to 
those of [5j Assumption 4.4.2 (or (H))] with the only difference that for 
the moment we do not assume some kind of local integrability on the jump 
measure, as in (iii) of 5, Assumption 4.4.2]. 

Note that the assumption of an Ito-semimartingale is satisfied by all continuous- 
time models used in mathematical finance. Indeed, beside the assumption of abso- 
lutely continuous characteristics, this is the most general model-free setting which 
is in accordance with the no-arbitrage paradigm. This assumption is actually also 
the only one needed to prove consistency of the Fourier-Fejer instantaneous covari- 
ance estimator, denoted by Xt- However for establishing a central limit theorem 
we also need some structural assumptions on the instantaneous covariance process 
X: 
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Assumption (HI). Assumption (H) holds and the instantaneous covariance pro- 
cess X is an Ito-semimartingale of the form 

Xt=x+ [ bfds + yf QidB,.,+ I I ^fi''{d^,ds) 



where 



t>o 



• B is p-dimensional Brownian motion, which can be correlated with Z , 
the Brownian motion driving the log-price process, such that d{Zi,Bj)t — 
Pt^ijdt, where pij is adapted cagldd for all i G {1, . . . ,d} and j G {1, . . . ,p}, 

• b'^ is an W^^'^-valued locally bounded predictable process, 

• (Q"')jG{i....,p} ^•s adapted cagldd process taking values in Sd, 

• p^ is the random measure associated with the jumps of X , whose com- 
pensator is given by Ft{d^)dt, where Ft{d^) — Ft{uj,d^) is for each (w,t) 
a measure on Sd such that the process 

(3.2) (^^ {Vxt- + ^ - ^x;:)Ft{do^ 

is locally bounded. 

Moreover, both processes Xt and Xt^ take their values in Sj"*", the set of all 
(strictly) positive semidefinite d x d matrices. Furthermore, the drift process b^ 
of Y is additionally assumed to be adapted and cagldd. 

Remark 3.2. (i) Assumption [HI) corresponds essentially to [5l Assumption 

4.4.3 (or {K))] or jll Assumption [HI)], respectively. The main difference 
is that we require X to be an Ito-semimartingale, whereas in the above 
references this condition is explicitly stated for ^/X. Since we assume 
additionally that Xt and Xt- take values in S^^ , \fX is again an Ito- 
semimartingale. Local boundedness then also holds for the drift and for 
the compensator of the jumps of ^fX, where the latter is a consequence 



of condition (3.2 1. The motivation to state Assumption (i?l) in terms of 
X stems again from applications to 5'j'-valued affine processes, where the 
characteristics of \fX would have a much more complicated form than the 
simple affine dependence on X. 

(ii) Also in view of affine processes we prefer the formulation in terms of a 
Brownian motion B, correlated with Z instead of decomposing B into Z 
and another independent Brownian motion. 

(iii) Concerning the jump part we implicitly assume that the jumps of X are 
of finite variation, whence we can avoid the introduction of a truncation 
function. This is not restrictive in our case, since in the central limit 
theorem below continuity oit^Xt is needed. 

4. The role of pathwise covariance estimation in robust calibration 

In this article we combine jump robust estimators (already considered for inte- 
grated covariance estimation) with instantaneous covariance estimation based on 
Fourier methods, as introduced by Malliavin and Mancino [HIS]- More precisely, 
our method consists in conducting the following three steps: 

(1) the first step is to reconstruct non-parametrically from discrete observa- 
tions along an equidistant time grid Ai — {t^ < . . . < t^„xj ~ -^i ^^^^ ^^'^P 
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width ^ of the log-price process Y a discrete set of points on the trajec- 
tory of the instantaneous covariance process X along a coarser, equidistant 
time grid A2 C Ai: 

(^t)teAi ^ {Xt)teA2 ■ 

(2) using then the reconstructed path X along the coarser grid A2 allows 
in principle to repeat the first step, or at least to estimate integrated 
quantities of that discretely given trajectory, for instance to compute an 
estimator for the integrated covariance of X: 




iXt)teA2 

where q is defined by qij^'j' = J2k,i QijQi'j'- 
(3) under some parametric specification of X , e.g., being an affinc S'j^-valued 
process, certain parameters associated to the covariance of X and the 
correlation between X and Y can then be determined from the previously 
defined estimators. 

In order to perform the first step (1), that is, the non-parametric pathwise co- 
variance estimation, we rely on (a modification of) the Fourier method introduced 
in [SI H] . In order to describe its main idea, let us first introduce some notation: 
for an L-'^[0,T] function / we denote its Fourier coefficients for fc G Z by 

Him ^ ^ 1^ fit)e-''^''dt. 

The Fourier method is now best described by the following steps. Notice that - as 
outlined in Section [2] - we could perform these steps for any orthonormal system 
inL2([0,r]). 

(la) Recover from a discrete observation of Y an estimator for the Fourier co- 
efficients of the components of the path t — J> p{Xt{uj)) for some continuous 
invertible function p : Sd ^ Sd- In other words, find an estimator for 

Hp{X)m p{Xt)e-''^^'dt = (efc, p{X)) . 

(lb) Use Fourier-Fejer inversion to reconstruct the path of t ^ p{Xt)- In fact, 
by Fejer's theorem 

N 



L — _ /\r V / 



k=-N 

converges uniformly (and in L^) to t 1— > p{Xt) on [0,T] ii t ^ Xt is 
continuous. If X has cadlag paths, then the limit is given by Pt^^'^+^P^-^'-) , 
Due to central limit theorems on the fine grid Ai we make errors in the 
reconstruction of p{X) of size 

N 



E 



1 - \^] S.e'-^'^^ 



k=-N 

where £k is a sequence of error random variables, which are approximately 
conditionally Gaussian with variance of order 0(n~^). Hence it does not 
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make sense to use all Fourier coefficients from — [nTj to [nT\ , but tfiere 
will appear a subtle relationship between the sum of the errors, the size 
of TV with respect to n and the rate of the central limit theorem for the 
reconstruction. See also the remarks in Section [2| 
(Ic) Invert the function p to obtain an estimator X of the realized path of X. 
For the second step (2), we can rely on existing estimators for the realized inte- 
grated covariance, into which we plug the estimator X obtained in the first step. 

In the third step (3) we focus on particular parametric specifications of X. One 
particularly tractable class is the class of affine models. In this case the quadratic 
variation of X satisfies 

(XS, = 4a,, / a e S+ , 

Jo 

such that knowing an estimator for {X^^, Xjj)t and an estimator for 

(r/,r/)T= / Xs,^jds, 

Jo 

namely Xs.ijds allows to estimate the parameter aij. 

5. Estimating Fourier coefficients 

In this section we focus on step (la), i.e., on how the Fourier coefficients of 
t !—> p{Xt) can be estimated from discrete observations of Y. Realizing that the 
only difference with respect to estimators for integrated (functions of the) realized 
covariance are the terms 6"'^*=* in the integral for the Fourier coefficients, we can 
make use of (Fourier basis modified) jump robust estimators like 

• the power variation estimators considered by Barndorff-Nielsen et al. [T], 

• estimators for the realized Laplace transform of volatility introduced by 
Tauchen and Todorov [12] and 

• other jump robust specifications, as for example considered in O Theorem 
5.3.5] 

In order to introduce those estimators, let us start by making some assumptions 
on the observations of the log-price process Y. Throughout let T > be fixed and 
suppose that the time grids of observations for all components of Y in [0, T] are 
equal and equidistant, i.e., 

Tn, 

C = -, m = 0,...,lnT\. 

n 

The increments of a process Z with respect to the above time grid are denoted by 

Remark 5.1. If grids are non- equidistant and non-equal for different coordinates it 
might be wise to use estimators, whose input are more continuous quantities than 
increments, e.g., Fourier coefficients. This is outlined for instance in [51|S]. In any 
case our method will provide as a result continuous path functionals such as Fourier 
coefficients after the first estimation procedure. 

The estimators for the Fourier coefficients that we consider are of the form 

, VnT\ 

(5.1) F(r, g, ky^=-Y. e-'^'=*^^-g(Vr^A;^y), 

n ^ — ^ 

m—l 
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for some function g : M'^ — > Sd and we write 

V{Y, 5) (ViY, g, ~Ny^, . . . ,V{Y, g^O)^, . . . , g, iV) J)^. 

Note that the 0-th Fourier coefficient V{Y, g, 0)^ corresponds to estimators for inte- 
grated (functions of the) covariance, as the power variation estimators, considered 
by Barndorff-Nielsen et ah (see, e.g., [1]), but also the reahzed Laplace transform 
estimator, introduced by Tauchen and Todorov [H]. Indeed in these cases the 
function g is given by 

g-.R-^^Sd, ixi,...,XdV ^ {\x,\''\xj\'),,j^{i^,„^d}, r + .s>0 

and 

g -.W^ Sd, (xi,..., Xd)^ {cos{xi + Xj))i^j^{i^,,,^d} 

respectively. 

Our first aim is to study asymptotic properties of V{Y, g)^''^ , for which we rely 
to a large extent on the results of |T] , [3] and [S] . The following assumptions on the 
function g, needed to establish consistency and a central limit theorem, are also 
taken from [T]: 

Assumption (J). The function g is continuous with at most polynomial growth. 

Assumption (K). The function g is even and continuously differentiahle with par- 
tial derivatives having at most polynomial growth. 

Assumption (K'). The function g is even, with at most polynomial growth and 
outside a subset B of which is a finite union of affine hyperplanes. With 
d{x,B) denoting the distance between X eR"^ and B, we have for some w G (0, 1] 
and p > 



xeB^^\\Vgix)\\<Cil + \\x\n{l 

„^ n , d(x,B)^ 
x^B\ b||<(lA^^) 



d{x,BY- 



I V/(x + y) - V/(x)|| < C\\y\\{l + Hxf + hV) 1 



d{x,By 



Remark 5.2. The conditions of Assumption {K') are especially designed to accom- 
modate the functions 

{xi,...,Xd)^ M- {\xi\''\xj\'')ij(z{i^,„^d} 

for r + s < 1, which correspond to the jump robust power variation estimators. 

In the case when K is a pure diffusion process the results of [Tj carry directly 
over to the Fourier basis modified statistics V{Y,g,k)l^. In the case of jumps, 
the respective assertions of Theorem 3.4.1, Theorem 5.3.5 and Theorem 5.3.6] 
can also be directly transfered to V{Y,g,k)lj^. A sufficient condition which allows 
to incorporate jumps and which is also satisfied by the assumptions of the cited 
theorems, relates the function g with the jump activity of (a localized version of) 
Y and is stated in Assumption {L{ri)) below. Let us denote by the diffusion 
part of Y with respect to some truncation function x' i i-e., 

(5.2) Diixi = y + J^{^^ + l (^'(^) - x{0)Ksido^ ds + ^/x::dz,. 
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Then we shall require that the ii-norm of g(V^A;^y(p)) - gi^A'^^D^'^P^x')) 
goes sufficiently fast to uniformly in m, where Y(p) denotes a localized version of 
Y. 

Assumption (L{ri)). Let x' be a truncation function such that the modified drift 
ofY 



b''{x') = b^+ ix'{0-x{0)K{dO 



(5.3) 

is cdgldd. Moreover, suppose that there exists an increasing sequence of stopping 
times (Tp) with linip Tp — od a.s. and processes Y{p) such that for t < Tp 

(5.4) Y{p)t = Yt a.s and DJ {x!) = dI^^\x') a.s. 

For r/ > 0, we then have for all p 



lim sup n^W, 

"^°°l<m<LnTJ 



= 0. 



In Section |5.2[ we shall give precise examples of g for which this condition is 
satisfied. 

In the following section and throughout the paper, C always denotes some con- 
stant which can vary from line to line. 



5.1. Consistency and a central limit theorem for estimators of the Fourier 
coefficients. The following result for the statistics V{Y, g)^^ is a slight modifica- 
tion of p3 Theorem 2.1. and Theorem 2.3] when y is a diffusion and of 5, Theorem 
3.4.1, Theorem 5.3.5 and Theorem 5.3.6] in the general case. 

For its formulation we need some further notation: let / : [0, T] — > Sd be some 
L^{[Q,T\) function. Then we denote the (2 A'' -I- l)d x d dimensional "vector" of 
Fourier coefficients (the "coordinates" of Section [2]) by 

-^^(/) = {HI){-N). . . . ,-F(/)(0), . . ..HI){N)V ■ 

Moreover, for a function /i : M'^ — > M™ and a d-dimensional normally distributed 
random variable U with mean and covariance X, the first moment of h(U) is 
denoted by ph{X), i.e., 

Ph{X)=E[h{U)], f/^AA(0,X). 
By F{ph{X)){k) we then mean 

Hpu{X)){k) = }^ j\,,{Xt)e-''^^'dt. 

Theorem 5.3. Under Assumptions {H), (J) and (L(0)), we have 

V{Y,g)--'' ^TT^'ip.iX)). 

Moreover, under the assumption (HI) and (K) or{K') and{L{^)), theC'^^^^^'''^^'^- 
valued random variable 

Vn{v{Y,g)-^''~TT^ip,iX))) 
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converges for n —> cx) stably in law to an J- - conditional Gaussian random variable 
defined on an extension of the original probability space with mean and covariance 



where i,j, i' ,j' G {1, . . . , d} and k, k' G {—-/V, • • ■ , N}. 

Remark 5.4. (i) Stable convergence in law for a sequence of random variables 

([/„) to a limit U (defined on an extension of (fi, J^, P)) means that, for any 
bounded continuous function / and any bounded J^-measurable random 
variable V ^ we have 

lim E[F/(C/„)] -E[V^/([/)]. 

(ii) The above convergence results do not only hold for T fixed, but we have 

y(y,.g)-^^r^^(p,(x)) 

locally uniformly in T and also stable convergence process-wisej^ The 
latter means that 

V(r,.g)J^^~TJ-^(p,(X))) 
converges stably in law to a process U{g, N) given componentwise by 

2(2Af+l) d T 

(5.5) C/(g,7V)*^.^= E / ^%.r'r'^s.er 

k' = l z'j' = l-^0 

where 

2(2Ar+l) d „T 

EV^ / rfer rfc'r ^ (-<kk' 

I ^s.,lj,pq^s,i'j' ,pq ^ijl'j'- 
r=l p,q=l-^'^ 

Here, is a 2{2N + l)d x d-dimensional Brownian motion which is de- 
fined on an extension of the probability space (fi, J^, (J^t)f>o, P) and is 
independent of the cr-field J-. 

(iii) The above theorem has been proved in [1] in a pure diffusion setting and 
A; = 0. Inclusion of jumps has been considered (in the one-dimensional 
case) in j^j and |T3] for g = \x\^ and in [T2] for g = cos(a;). More general 
functions (also for the case k = 0) are treated in 0] and Theorem 3.4.1, 
Theorem 5.3.5 and Theorem 5.3.6]. 

(iv) In the examples g{x) = \x\^ and g{x) = cos(a;), the function Pg{x) corre- 
sponds to 

Pi:,^l:,in{x) = \xf^E[\Un,U^Af{0,l) 

and 

respectively. 

"'^Here, J-"(/)(fc) is defined for variable T. 
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Proof. We consider the one-dimensional case, i.e., g : M — > M, since the muhi- 
dimensional case follows from it immediately in a methodological sense. Moreover, 
by the so called localization procedure as described in T, Section 3] or Section 



4.4.1], we can strengthen assumption (H) on Y together with Condition (5.3) to 
(SH) (see [Tl Section 3]), that is, {x') defined in (5.3), and X are supposed to be 



uniformly bounded by a constant. Similarly, assumption (HI) on X is strengthened 
to (SHI), that is, (SH) holds and the differential characteristics of the drift and 



the diffusion part of vX and (3.2), that is 



( [ {^Xt^+C-^^)Ft{d0) 

\JSi / t>Q 



are bounded by a constant (compare |1, Assumption (SHI)] and [5^, Assumption 
4.4.7 (or (SK))]). 

Let us introduce the following notation 



nJXr 



and let us split 



.,1, 



TT^'ip.iX))) 

^s^(C_i)5(ynA;;y) - / B''is)p,{x^)ds 



LnTJ 



into 



where 



LnTJ 



n,3 



/ (B^^ (C-i)P™-i(5) - is)pAXs))ds 

^6^(C-i)(5(y^A;^r)-g(/3;;)). 



We divide the proof into several steps: the first step, which is subject of Propo- 
sition 5.5 below, consists in dealing with 

LnTJ L»TJ 

(5.6) U^{g,N) ^ ,7™'' ^77^^ ^^(C-i)(.9(/3"J - 

m— 1 7n— 1 



As proved in Proposition 5.5 below, iU"{g, N))nen converges stably in law to the 
process U{g,N), defined in (5.5). 
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For the central limit theorem the second step consists in showing that 

[nT] [nT] j.. 



m=l m=l -^C-l 



{s)pg{Xs))ds 



converges in probability to uniformly in T. This can be shown similarly as in [5l 
Section 5.3.3, B]. Note that for the consistency result it is enough that 



lnT\ 



LnTJ 



converges in probability to uniformly in T, which is implied by Riemann integra- 
bility. 

In the third step we finally consider 



LnTJ 



LnTJ 



m—l rn—1 

which we split into J2m=i Vm'^^ + Vm^^i '^ith 



„n,31 



1 



B^{t-,_,)ig{VnA-,D^x'))~9{P:^J), 



„n,32 



where (x') denotes the diffusion part of Y defined in (5.2). For the central limit 
theorem to hold true, X]m=i Vm^^ to converge to in probability uniformly in 
T. To this end, it suffices to prove 



(5.7) 



(5.8) 



LnTJ 

EE[h™'''l 



m—l 



LnTJ 







holds true uniformly in T. Indeed, (5.7 1 implies (compare [5j Lemma 2.2.11]) 



LnTJ 



m—l 







and thus together with (5.8 1, we have J2m=i Vm^^ ~^ (uniformly in T). Both 
requirements (5.71 and (5.8 1 are met under the conditions on X, Y and g and the 
proof of (5.7) can be found in [1] Proof of Theorem 5.1] and (5.8) is shown in [SI 
Section 5.3.3, C]. For the consistency result only (5.7) has to be satisfied, which 
holds under the less restrictive assumptions (H) and (J) (see |TJ Theorem 5.1]). 
Indeed, we only have to show that 

L«TJ 
m—l ^ 
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which follows from the Cauchy-Schwarz inequality and (5.7), since 

LnTJ 



LnTJ 



J2^E[\\r,-J^\\]<iTj2E[\\r,-jr] 



0. 



Finally, according to the assumptions (i(0)) or respectively, we have for 

all e > 



(5.9) 
and 
(5.10) 
where 



L«TJ 



.AT'" 



> £ 



LnTJ 



< P [Tp < T] 4 

< P [Tp < T] + P 



> e 





LnTJ 








> e 




m—l 







LnTJ 






E ^^^""(p) 


> e 




m— 1 





and Y{p) is defined in (5.4). Due to {L{0)) or (L(i)), respectively, the second term 
on the right hand side of (5.91 and ( |5.10 1 respectively tends to as n — cx) for all 
p. Since P [r^ < T] — > as p oo, we deduce 



LnTJ 



32 



> e 



and 



respectively, which completes the proof. 





LnTJ 






E 


> e 




m— 1 





□ 



The following proposition is an application of [U Proposition 4.1] or [S! Theorem 
4.2.1], whose proof we state for convenience. Again we here only consider the 
one-dimensional case, i.e., Y and X are one-dimensional and g : M — > M. 

Proposition 5.5. Assume that the process X is cadlag and bounded by a constant. 
Let g be a function of at most polynomial growth. Then the sequence of processes 
{Uf{g, N))neN defined in ( |5.6| is C-tight. Moreover, if g is even, then it converges 
stably in law to the process U{g, N) given componentwise for k e {—N, . . . , N} by 



2{2N+1 



(5.11) 

where 



U{g,N%= E / ^fdW^'^ 
k'=i ^0 



2(2Ar+l) 

= (pAXs) - (p,(X,))2)e-'%^('=-'=> 

and W is a 2{2N + 1) -dimensional Brownian motion which is defined on an ex- 
tension of the probability space [VL,T, (J^t)t>o,P) and is independent of the a-field 
T. 
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Proof. Let us write 



with 



LnTJ 



m=l 



Sm 'Im 



s^'(C-i)(5(^^)-p;^-i(5)). 



In particular, for k G {—N, . . . , N}, let 



LnTJ 
m=l 



Observe that 



(5.12) 



and 



E 



Hence we have, for k, I S {—N, . . . , N}, 



= ^cos{ -=^fcC-i ) COS ( ) iPl-iig') - {Pl-i{9)f), 



E[ReC(fc)ReC(OI-Ft._^ 
1 / 27r 
n V ^ 
E[ImC(fc)ImC(0|^C_J 

= - sm ( - — j sm 
E[ImC(A)ReC(OI-^C-J 



27r, 



27r 



iC-i)(p;;-i(5')-(p;j.-i(5)r). 
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Thus by Riemann integrability and the cadlag assumption on X, it follows that 
(5.13) 

LnTJ 

^E[ReC(fc)ReC(OI-^t;;_J 



m=0 



LnTJ 



cos I — — ks I cos 



^E[ImC(A;)ImC(OI-^C-J 



jn=0 



L«TJ 



5]E[ImC(fc)ReC(0|J-c_J 



27r, 
T 



ks sin 



27r 



2^, 



sm 



2n, 



2n, 



T 



- — fcs ) cos ( -—Is ) (pg2(X,) - {pg{Xs)r)ds, 



as n ^ CXI. Due to the polynomial growth of g we further obtain 

(5.14) E [iicri-F*;j,_J - ^ 



< 



which already implies tightness. If g is even, we have due to AJJ;^^ — —IS^Z 



(5.15) 



E 



= 0. 



Finally let N be any bounded martingale which is orthogonal to Z . Then we obtain 
by the same arguments is in [1, Proposition 4.1] that 



(5.16) 



E 



0. 



Due to ( |5l2| ), ( [5T3| , ( [5l4| ), ( [5T5| ) and ( |5T6| , [GJ Theorem IX.7.28] (see also 
Theorem 2.2.151 



) now implies that for fc, I G {—-/V, . . . , iV} 



Ref/^5(5,Af)(fc) 



Imf/^'(5,Ar)(fc) 



2(2Ar+l) rj. 



E 







2(2Af+l) 



, = 1 ^0 



where L — s means stable converges in law (see Remark 5.4|[(i) I and 



2(2Af+l) 

E ^ 



„Re Rc 



27r 

= cos — — k 



3 = 1 
2(2Af+l) 

E <fc,<y=sin 

2(2Af+l) 



2^, 
T 

2tt. 



ks sin 



-^;.)(p,.(x,)-(p,(x,))^), 



27r, 
T 



is]ipg.ix,)-ipgix,)y), 



27r. 
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and W is 2{2N + l)-dimensional Brownian motion defined on an extension of the 
probability space {n,J^,{Tt)t>0;P) and independent of the u-field Defining 



yields the desired assertion. 



□ 



5.2. Jump robust estimators. The aim of this section is to specify classes of 
functions g and conditions on the jumps such that condition {L{0)) or (L(^ )), 
respectively, is satisfied and such that the estimator V{Y,g,kyj. given in (5.1) is 
robust to jumps. The conditions on g are in line with the respective assumptions 
in [51 Theorem 3.4.1 (a), Theorem 5.3.5 (7) and Theorem 5.3.6, Equation 5.3.11]. 

Proposition 5.6. (i) Let g be continuous with g[x) — o(||a;^||) as \\x\\ — > 00. 

Moreover, suppose that 

(■T 



(5.17) 



(5.18) 



sup 



ii^^f 11 + 11^*11+ /(iieir Ai)ifi(rfe)M^<oo 



and supj^ (^[q || AX((a;)|| < 00. Then for all m (z {1, . . . , [ntj } 



hm E 



\g{VnAl^Y)-g{V^AZ,D^{id))\ 



0. 



(ii) Suppose that g satisfies for some q > and some < r < r' < 1 
(5.19) \\gix) - giy)\\ < C(l + \\y\m\x - yf + ||x - yf). 

Moreover, let j3 € [0, 2] and assume that for all t £ [0, T] 



(5.20) 
(5.21) 



E 



mfKtido 



?ii<i 



E 



«ll>i 



< 00, 



< 00 



(5.22) 



and that {xi{j3>i}) — + J (x(0l{/3>i}~x(C))-^('^C) 0,^ defined in (5.3) 
and X is uniformly bounded. 

(a) If (3 <r < 1, then there exists some n G (1, ^] such that 

E[\\g{V^Ai;,Y)-g{V^Al,D^{0)))\\]<C^^, 

for all m Cz {I, ■ ■ ■ , \jit\}. 

(b) If P>r, then 



E [\\g{^A^^Y) - g {V^A^^D^ {xl{py^)) ||] < C 



[nt\}. 



1 



2-/3 7 
jf 2/3 



for all m (z {1, 

Remark 5.7. (i) Another sufficient condition to obtain (5.18) is 



(5.23) 



E 



Y\2 



{hi) 



J eKt{do+xt 



< 00 



for all t € [0,T]. Note also that we can localize Y and consider processes 
{Y{p))p for which (5.17) or (5.23) is automatically satisfied (see e.g. 
Lemma 3.4.5]). Assertion (i) then holds true for Y{p), which implies that 
(L(0)) is satisfied without further conditions on the characteristics of Y. 
Similarly the boundedness assumption on ^^(xl{^>i}) and X in statement 
(ii) can also be obtained by localizing the original process. 
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(ii) Condition (5.19) is satisfied by many classes of functions, e.g., globally 
Holder continuous functions or functions which satisfy ||.g(2^)|| < C(l + 
11x11''') for r' e [0, 1) and ||Vg(a;)|| < C(l + for some q>Q (compare 

also [SJ Remark 5.3.7]). Let us also remark that, if (5.19) is satisfied with 
g = 0, K in (5.22) can be chosen to be 1. 

Proof. The first assertion is proved in [5l Lemma 3.4.6]. Concerning (ii), we shall 
distinguish the cases /3 < 1 and (3 > 1 and set without loss of generality xiO — 
^{\\^\\<i}^- Due to the assumption on g, we have 

E [WgiV^A^Y) - g {V^A^D'' {xl{p^i})) ||] 



< CE 



(5.24) 



C-i "'{ll?ll<i} 



(i + ||^A;^i?^(xi{^>i})||^)x 

a^^''{d^,ds)-^p:,,^K,{dOds) 



C-i "'{iieii<i} 



Applying Holder's inequality, we get for 1 < i, k < oo such that 7 + ^ = 1 
E[\\VnA-^D^{xl{p>i}WJ'] 

< E [\\VnA-^D^{xl{p>i})r] ' E [j''^] ^ . 

Here, J stands for each of the above expressions of the jumps and k corresponds 
to r or r' . Due to our assumptions on {x^{i3>i}) X and as a consequence of 
Holder's, Jensen's and Burkholder-Davis-Gundy's inequality the first expectation 
is bounded by a constant Cg.t for all q > and 1 < t < 00. 
Let us now consider for some a e (0, 1] 



E 



'C-i "'{iieii>i} 

Then since, for a e (0,1], || < E^ \\x 



e/i^(d^,ds) 



and due to (5.21 ) 



E 



C-i "'{ll?ll>i} 



< E 



= E 



C-i "'{ll?ll>i} 



\a''K,{dOds 



< C- 
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Consider now the term 
E ' " 



C-i "'{iieii<i} 



for /3 < 1. Then by Holder's inequahty and (|5.20|), we have 



E 



t".-i "'{II?II<1} 



E 



< E 



^ A.>^l{A,y<i} 

*m-l<S<tm 



E 



< C 



i„i-l<s<t„ 



C-l -'{ll«ll<i} 

1 



n f 



If /3 > 1, we obtain a similar estimate by using Holder and Burkholder-Davis- 



Gundy's inequality, the fact that f < 1 and ( |5.20p : 



E 



2 

.Y ( 



C-i "'{ll«ll<i} 

ill. 



< E 



< E 



< E 



< E 



af^^d^,ds)~K,{dOds) 

a^^''{d^,ds)-K,{dOds) 



uri^'idtds) 

C-i "'{ll«ll<i} / 
^ \\A,Y\f^^^Y<i} 

U\fKs{dOd. 



tm~l<S<t 



C-1 -'{ll«il<i} 



The last equality follows from the fact that for /? > 1, ^ < 1. Using these in- 
equalities and setting a equal to rn and r' k' , respectively, for some 1 < k < ^ and 
1 < k' < ^, we can estimate ( |5.24[ ) by 

E mV^A^^Y) - g {^nK.D-[xh,>.,)) \\] < ^ ^,,..^,XiM^)^^y 
If /3 < r, we can choose some k € (1, such that this expression is simplified to 
E [^g{,/^AU) - g {V^A^D'' {xl{py^)) ||] < C^-^ 
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and if /3 > r, we obtain 



E [\\g{V^A"^Y) - g (0iAr„i?^(xl{/3>i})) ||] < C— 



n 2/3 



□ 



Corollary 5.8. (i) Let g be continuous with g{x) ~ o(||a;^||) as ||a;| 

Then under assumption (H) we have 



(ii) Let g satisfy (5.19) with r G (0,1) and assume that (5.21) holds true. 
Moreover, let j3 € [0,1) and suppose that (5.20) is satisfied and that 



6^(0) 



is caglad. Then under the assumptions {HI) and (K) or {K') and ^zr^ < 
r < 1, the C^^^'^^'f'^^ -valued random variable 



:(n>^,g)-^-Tj-^(/,,(x))) 



converges for n — > oo stably in law to an T -conditional Gaussian random 
variable defined on an extension of the original probability space with mean 
and covariance 



iji'j' 



where i, j G {1, . . . , rf} and k, k' £ {—N, . . . , N}. 



Proof. Assertion (i) is a direct consequence of Theorem 5.3 Condition (L(0)), 
Proposition 5.6|[(i) and Remark 5.7||(i) 



Again, in view of Theorem |5.3[ the proof of the second statement consists in ver- 
ifying Condition (L(i)). Since (3 < 1, we can consider fe^(O) = b''^ — J xiO^i'^O 
which is - since it is a caglad process - locally bounded. By the localization 
procedure (see Lemma 4.4.8]) we can therefore consider processes {Y{p))p, for 
which b^'^'P\Q) and the diffusion characteristic X are uniformly bounded and (5.21 ) 
and ( 5.20 1 for /? < 1 are satisfied, as required in Proposition 5.6||(ii) 
This proposition then yields in the case 2^ < P < r 



(5.25) 



E 



< C- 



for some kg (1, and in the case ^zrj^ < r < (3 



E 



giV^A-^Yip)) - g [V^Al^D^'^^^Q) 



< C 



n 



By choosing 1 < k < < ^ in ( 5.25| ), (i(|)) is satisfied in both cases and the 
assertion follows. □ 
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Remark 5.9. (i) The specifications 

(5.26) g:R'' ~>Sd, (xi, . . . , x^)^ ^ 



9 ■ 

for r < 1 and 



,Xd 



Y ^ {cos{xi + Xj)) 



j))i,j&{i,...,d} 



are covered by these conditions. In the case of (5.26), the above corollary 
recovers [21 Theorem 1 (iii)], which has been proved for one dimensional 
jump diffusions where the jumps are described by a Levy process. For func- 
tions satisfying (5.19) a similar statement is proved in [5, Theorem 5.3.5 
(7) and Theorem 5.3.6], however, under slightly different conditions on the 
jump measures (in particular, supposing the Gringelions representation of 

.. . . . 

(ii) Another function which satisfies for example the above requirements and 
for which pg is invertible and easily computable is 



9 ■ 



Sd, {xi 



where Aij = CicJ + ejej 
vector. The function pg 



is given by 



^ and e, denotes the canonical basis 



\/ + 2Xij + Xjj + 1 



6. Asymptotic properties of estimators for (functions of) the 
instantaneous covariance process 

In this section we focus on step (lb) and in the sequel on step (Ic), that is, we are 
interested in establishing consistency and a central limit theorem for an estimator 
of Pg{Xt) and Xt respectively. The estimator for pg{Xt) is defined via Fourier-Fejer 
inversion using the above estimators for the Fourier coefficients: 

(6.1) (l-^)e'^'=V(y,.g,fc)^. 

Once we have obtained a consistency and a central limit theorem for this estimator, 
we can translate these results to an estimator for At, which we define via 

(6.2) X^'"" := p-g^ [pjcil'''^ 

provided that Pg(x) : Sd 'x Sd, x 1-^ Pgix) is invertible. 

Let us start with the following lemma which deals with convergence of Riemann 
sums for the Fejer kernel and which is crucial for studying asymptotic properties 
of the above estimators. 

Lemma 6.1. Let n,N E N and denote 



1 

= -TT + - < 

n 



\n2n\ 

= -7r+ i 

n 
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(6.3) 



Fn{x) 



1 siii((7V+l)f)' 
N + l Sim 



k=N 



E 1 



k=-N 



TV 



Moreover, let j (z R and suppose that lim = X /or some constant K > and 
let h be a 5-Hdlder continuous Junction. Then, we have 



(6.4) 

(6.5) 
(6.6) 



..N 



L»27rJ ^ 

lim ^ -Fjv(x;;_i) 

m— 1 

lim -'^ivl^m-i-' 



n,N^oo ^ — ' n 
m— 1 



TV 



[n27rj 



lim y 

m— 1 



lim 



lim 

N^oo 



FN{x)dx = 2tt, 



^dx-^ 
N 3 ' 



= lim 

N^oo 



h{x)dx 



Kv), 



ifj>l. Moreover, if ^ > 1, then 

[n2n] 



7n=l 



Fjs[{x)dx 



< c- 



N-r- 



L"27rJ 

^ -Ma;:^-i)i^^(x:;,_i) - / 

m=l "'-^ 



h{x)Fpf{x)dx 



< Cmax 



1 



Remark 6.2. 



(i) In the sequel we shall consider analogues of expressions ( 6.5 ) 



and (6.6) on the interval [0,r], that is, 



lim 

N~¥oo 



N 



dx — 



lim 



N 



h{x)dx 



2T 

T' 

2T 



which can be derived similarly as in the proof below, 
(ii) The expression Fn{x)'^/N appropriately normalized corresponds to the 
so-called Jackson kernel (see e.g. [71 Section 4.2]). 



Proof. We have convergence of the Riemann sums in (6.4 1 to the corresponding 
integral if the grid becomes finer within the zeros of Fn . Since the distance between 
2 zeros is -j^^ ^'^^ ^ zeros closest to the origin), this is the case if only if 

7 > 1. By the same argument the left hand side of (6.5) and (6.6| converges to the 
corresponding integrals if and only if 7 > 1. The assertion then follows from the 
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following calculation 



N 



k,l 



N 
N 



N 
N 



„ 2Ar2 + l 47r 

Concerning the second statement, we have due to the mean value theorem 

[n27rj 



1 J— TT 

n27rj 

^ -(F,v(a;;^_i)-F;v(C-i)) 

m— 1 



m— 1 

2 

^ E 

(iV + l) 



E 



|i^Ar(x;^_i)-FA,(C-i)| 



27rfc 22r(fc + l)l 



with some G [^^m-n a^JJ^]- Using again the mean value theorem, we can further 

estimate 

1 



E 



n 



N+l ' N+1 

^ E 

27rn 1 



\FM-i)\ 



< max |F^(r?)| 

'/tliV + 1' iV + 1 



27rfc 2jr(fc + l)i ' ' TV + 1 71^ 



Since 



and since 



max \F'^{v)\^<C max \Fn{x)\ , 

'/tLjv_|_i, jv_|_i J -^tLj,.,.!, J 



max |F;v(a;)| ^ < C 

^^r 2^fc 2^(fc+l) , ^ ^' N 



2^(fc + l) 
iV + 1 



27rfc 
Af+l 



FN{x)dx, 



it follows that 



2^r(fc + l) 



max 

„r! 27rfc 27r(fc+ 
'I'^ljV + l' N+1 



< CiV2 / FN{x)dx. 
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Hence, we have 



E E 



|Fjv(x:^_i)-i^Ar(y:;,_i)| 



+ 1 ' N + 1 



< 



27m 1 



E 



max 

p,22Tk_ 27r(fc + l) i 



< 



27rn 1 



1 -^-^ / 2,rfc 

- _ (W + l) N + 1 

~ 2 



< ^ 2™ 1 



iV + 1 n 



~ n n iVT-i 

which yields the assertion. Concerning 

Ln27rJ 



m— 1 

we can estimate it by 

L"27rJ ^ 



h{x)Fiq{x)dx 



E / {h{x:^^,) - h{x))Fr,{x)dx 



Lri27rJ 



< max|/i(a;'^_i)| ^ 



F]\i{x)dx 



c m 

1 



.5 piT 



Fpf{x)dx 



< C 

< Cmax 



1 



1 1 



□ 



6.1. Consistency. Using the above lemma we can now proceed to establish con- 
sistency of (6.1|. 

Theorem 6.3. Let 7 > 1 and suppose that lim — K for some constant K > 0. 
Under the assumptions [H), (J) and (i(0)) we have for every t G [0,r] 
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as n, N oo. Convergence is uniform in t, if 1 1-^ Xt is continuous. 



Proof. Similar as in jl] or Section 4.4.1] and the proof of Theorem 5.3 



we 



strengthen assumption {H) together with Condition (5.31 to boundedness oflr (x') 
and X. 

/ — - — -11, N 

Let us decompose I pg{X)^ 



1 ^ 



k = -N 



1 - 



\k\ 

N 



into 

P,{Xt_)+Pg{Xt) 



< 



T 



N 



E (1 

k=-N 
N 

k=-N 



N 
N 



e^^' {V{Y,g,k)^~TT{p,{X)){k)) 



e'^^V(p,(X))(fc) 



p.iXt-) + PgiXt) 



By Fejer's theorem the second term converges almost surely to 0. If t Xt 
is continuous, then convergence holds uniformly in t. For the first term, let us 
decompose Ek=-N (l " §) e^"^^\V {Y, g,ky^ - TF{pg(X))(k)) into 



N 



E 1 



k=-N 



N 



e'^'' {V{Y,g,ky^-TF{p,{X)){k)) 



LnTJ 



m— 1 

LnTJ 



E (^(^-C-i)) (ff(/3;^)-p:;-i(<?)) 

m,= l ^ ' 

m=l \ / Jo 



where 



fe=-JV ^ ^ 



Since t — >■ Pg{Xt) is cadlag and bounded (recall the boundedness condition on X 
and the fact that g has at most polynomial growth) and 7 > 1, the last term in the 
above sum converges almost surely to 0, uniformly in t, by Riemann integrability. 

As a consequence of the proof of Theorem 6.6 below, the second term converges 
to in probability, uniformly in i, under the assumptions {H) and (J). 

Finally we have to focus on the first term, which we decompose into 

L"^J I/O- \ 

FM[-{t- C-i) {gWnKJ) - giV^A-^D^ix'))) 



lnT\ 



^ — ^ n 



2tt 



V ( ^(t - C-i) ) (.9(V"A« i?"^ (x')) - giPD). 
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where {x') is defined in (5.2). The second term converges in probability to 0, 
uniformly in t, since it can be estimated by 

supE [UV^A^^D'-iy)) - gia] E - C-i) 



and we have sup„ E [\\g{yi^A^D^ {x')) — (see the proof of Lemma 

5.3, Lemma 5.4]). Writing C = {g{^A^^Y) ^ gi^A^.D' {x!))) and C(p) = 
((7(-ynA"jF(p)) — (7(ynA"j£'^(p)(x'))), where is defined in (5.4), we have 



(6.7) P 



LnTJ 



< P [T-p < T] 



(i-C-i) C 



L«TJ 



> e 



E^^^(^(^-c-i))c(p) 



> e 



Since P [rp < T] ^ as p ^ (X3 and since E [|| Y.tH {^{t - C-i)) C(p)ll 
can be estimated by 

supE[|ic(p)ll]E-^^hr(^-C-i) , 

m— 1 ^ ' 

which converges to for allp due to Assumption (L(0)), (6.7| tends to as well. □ 

The following corollary states explicit conditions on g and the jumps of Y such 
that (i(0)) is satisfied and relies on Proposition 5.6|[(i) above. 

Corollary 6.4. Let g he continuous with g(x) — o(||a;^||) as \\x\\ — > oo. Let 7 > 1 
and suppose that lim = K for some constant K > Q. Then under assumption 
(H) we have for every t € [0, T] 

Pg(X)t 

as n, N ^ 00. Convergence is uniform in t, if 1 1-^ Xt is continuous. 



Proof. The proof is a consequence of Theorem 6.3 and Proposition 5.6 (i) (as in 
Corollary 5.8|[(i) above). □ 

As mentioned in the introduction of this section, we can now transfer the con- 
sistency result to instantaneous covariance estimator (6.2 1. 

Corollary 6.5. Let g he such that Pg{x) : Sd x Sd, x i— > Pg{x) has a continuous 
inverse. Then under the assumptions of Theorem 6.3 or Corollary \6.4\ we have for 
every t G [0, T] 

i>n,N P -1 / Pg{Xt-) + Pg{Xt) 
-^Pg 2 

as n, N ^ 00. If t i-^ Xt is continuous, then 

..N P 



X 



Xt 



uniformly in t. 



Proof. This corollary is simply a consequence of the continuous mapping theorem. 

□ 
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6.2. Central limit theorem. This section is dedicated to the proof of the central 
limit theorem for (functions of) the instantaneous covariance. Here we need to 
assume that the trajectories of X are almost surely Holder continuous with some 
exponent 5, which thus strengthens assumption (-ff 1) and excludes covariance pro- 
cesses exhibiting jumps. 

Theorem 6.6. Assume that the paths of X are almost surely Holder continuous 
with exponent 5. Let 1 < 7 < 2(5+1 and suppose that lim — K for some constant 
K > 0. Then under (HI) and (K) or {K') and {L{rj)) with rj > the random 

variable 

(6.8) n—i^pg{X\ ~Pg{Xt) 

converges for each t e [0, T] as n, N ^ 00 stably in law to an T -conditional Gauss- 
ian random variable defined on an extension of the original probability space with 
mean and finite non-zero covariance function given by 

61 K -I 

Remark 6.7. As already mentioned at the beginning of the article, it is remarkable 
that the asymptotic variance of the Fourier-Fejer estimator is smaller that the one 
of the classical local realized variance estimator. For simplicity, let us consider 
the one-dimensional case with g{y) = . Then the variance of the Fourier-Fejer 
estimator equals 

(6.9) 



under the above assumptions. In comparison, consider the classical (non-truncated) 
local realized variance estimator given by 

X, =yef{t)y(AZ,Yfef{t^_,) 



j — 1 m— 1 



with 



and tf ,,.l,...,Ar. 



Similar as in the above theorem, suppose 1 < 7 < 2 and lim — K for some 
constant K > Q. Then, according to j5J Theorem 13.3.3 b)] 



-n,Af 



n— [ X^ -Xi 



converges for each t E [0, T] as n, N ^ 00 stably in law to an J^-conditional Gaussian 
random variable with mean and covariance function given by 

2X1 



and is therefore | times bigger than (6.9). Let us remark that in the notation of 



Theorem 13.3.3 b)], fc„ corresponds to that is, the number of points in the 
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e (0, i) and /?' = K-rT such that the above assertion 



interval [t^_^,tf], r ^ ^-^ . ^ 
concerning the classical estimator is implied by ^5, Theorem 13.3.3 a)] with /3 = 
since t < i. 



Proof. Similarly as in the proof of Theorem 5.3 we strengthen the assumption (i? 1) 
to {SHI), that is, b^{x'), defined in (5.3), X and the differential characteristics 
of the drift and the diffusion part of y/ X and (3.2) are bounded by a constant. 
Analogously to the proof of Theorem |6.3| we now decompose 



n 27 



.N 



p,{X\ ~p,iX) 



into 



\ k=-i 



'-kt 



V{Y,g,krr\-Pg{Xt) 



7-1 

< n 2 



7-1 

+ n 



\k=-N ^ 
N , 



N 



e'^'' (F(r,5,fc)?-TJ-(p,(X))(fc))) + 



N 



e'^'^':F{pg{X)){k)-pg{X,) 



Due to the Holder continuity assumption on t t-^ Xt, the second term can be 
estimated by Cn^^ (compare [(T, Eq. 13]). Since 5 > by assumption, the 
second term converges almost surely to 0, uniformly in t. 

The first term is decomposed as before into 



-1 1 ^ 
— [ 1 



T 



k=-N 



\k\ 

N 




^.^kt (^v{Y,g,k)^-TT{p,{X)){k)) 



N 



{t- s)^ Pg{Xs)ds 



Due to Lemma 6.1 the last term of this sum can be estimated by 

= c( 



7-1 

Cn '^-1 max 



n \ — 7^ 

N 2 max 



1 1 

7V7-1 ' 



which converges to 0, since < min(7 — 1, (57) as a consequence of the assumption 



<5> V- 



FOURIER TRANSFORM METHODS FOR PATHWISE COVARIANCE ESTIMATION 33 



Let US now turn to the second term 

LnTJ 

T 



1 2^ 

—n 2t 

T — n 

m— 1 



E -^^ (^(i-c-i)) (5(/9rj-pr„-i(5)) 



L«TJ 



m=l 



where 



Since E 



..AT 



1 1 



0, we also have 



LnTJ 

lim ElEfc^l-^*;; 

n.N^oo ^ — ^ L 
m—1 



= 0. 



Moreover 
E 



1 



y2 

Thus we have 

LnTJ 



n 1 



(^-C-i) 



m,ij m.i' j' 



L«TJ 1 /27r \ 

m=l 77. ^ \ / 
;^(Pm-l(ffyff^'J') - Prn-l(ff^i)Pm-l(ff^'i')) T ' 

Due to Lemma |6.1| and Remark |6.2| the Umit of this expression is given by 
1 



V^'' ' : = Km 



T p2 (22i(. _ \\ 



w^oo T'^K~ Jo 

In view of Theorem [6l Theorem IX. 7. 28] it remains to verify that 

LnTJ 

(6-10) E ^ [ll^"«'^ll'i{ii^;i'"ii>e}l-^c-.] " 

7n— 1 

for all e > 0. By the Cauchy-Schwarz inequality we have 



N 







E 



7n,Nl\2 



{|2"ri>£} 



< WE 



,A| 



E 



{|Z,"-"l>e} 
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By definition of Z^!^^ and the polynomial growth of g, we can further estimate 



E 



IE 



rp4 



\9WI^)-P':n-1{9)\\ 



2tt 

2(7 + 1) N I 



_ P4 

A 



<c 



1 1 





7^2 (7+1) \ T 



2tt 



(i-C-i) 



Taking again the polynomial growth of g into account, there exists some p > such 
that 

hWl) ~ P™-i(.9)II < Cil + ||{/;;,_ir), P-a.s. , 



where U;i^_^ = ^JXt^^_^^l^Z ^ N{Q,Xt^^_J. Thus 



E 



'{||z,"-"||>c} 



\\U--A\ > ( ^ (n'^F^^ [fit - C-i) ) ^ - 1 ) ) ' l-^C-i 



>oo as 71,^"— foo 



Since this tends to 0, we can estimate (6.101 by 

L«TJ 

'y2 (7+17 -^ 



m=i - n 



2^ T2 „ 



n \ n J 



N 



0, 



where convergence to follows from Lemma |6.1| and the above estimate for 

\Z:f\\>e\^r:' 



hence Equation (6.10) is verified. Moreover, similarly as in the proof of Proposi- 
tion [STsl we have 



and 



E 



E 



= 



= 



for any bounded martingale M which is orthogonal to the Brownian motion Z. 
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-1-1 1 

1 — ' n 

m—l 



Finally we have to consider 

-Fat ( 

L«TJ 

. T 

L«TJ 

T 



m=l ^ ^ 



T ^ — ' n 

m— 1 



1 ^ 1/9 \ 



T ^ — ' n 

m— 1 



In view of Lemma |5, Lemma 2.2.11] it is sufficient to prove that 



.11) 

.12) 



2^2 

[nTl 



m—l 



1 ^e[c/-1|J-,._^ 



m—l 



0, 



and 



.13) 



where 





LnTJ 






E 


> e 




m—l 





(3(^A;j,i?^(x'))-3(/3.:;)), 

Let us first focus on U^i^. By 1', Lemma 5.3 and Lemma 5.4]) and the Holder 
continuity of X we have 



supE 



5(y^A:;i?^(x'))-5(0)| 







and we can therefore estimate (6.11 1 by 
supE 



21 1 \ - 1 



giVnA^;,D{x') - g{K))\\ \ ^ E ( 7^(i-C-i) ) , 



^1 n T 



27r 



T 



which converges to due to Lemma |6.1[ 

Concerning ( 6.12[ ), it is possible to decompose 



g{Vr^A-^D''{x')) ~ g^ ^Al^ + B^ 
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where for all 



E 



Section 5.3.3, C]. Then 

T-l 1 V-^ 1 ^ 

n^-< -Fn 

m—l 



= and E[||S;\||] < ^ with V > ^ (see 



27r 



-Fn 

m— 1 



<n^supE[||S;;,| 



27r 

7^ 



(i-C-i) E 



5" I J^f. 



1 1 

m—l 



N 



27r 



(i-C-i) 



converges due to Lemma 6.1 and thus yields (6.12). Condition (6.131 follows from 
the assumption L(r\) for r\ > and a similar estimate as in (6.7 1. The assertion 
is now implied by all these estimates and [6) Theorem IX. 7. 28]. □ 

Remark 6.8. (i) In the above theorem the assumption that the trajectories 

of X are J-Holder continuous is crucial. Typically this can be verified by 
using Kolmogorov's criterium, which states that the Holder exponent S 
satisfies 5 < - if 

a 

E[\\Xt-X.s\n<C\t-s\^+'. 
In our case we have for a > 2 
p 



E 



<C E 



<C E 



b^du 



E 



QidBu,: 



P 

E 



E 



WQirdu 



<C|t-sr + C^E 

<C\t - s\" + C\t - sl'i 
<C|t-s|t 

provided that E ||"] < oo and E < oo. For the last inequality 

we used the fact that we are working on [0, T]. If these moments exist for 
all a then we have Holder continuity with 6 < This is for example 
satisfied for affine diffusion processes on S'^ . 

(ii) If we have Holder continuity for all S < ^, then 7 € (1,2). The higher 7 
the better the convergence rate and it lies between (0, |). 

(iii) Condition {L{r])) with 77 > is satisfied, if the assumptions of Proposi- 
tion 



5.6 (u) 



hold with r > ^ 2=;g and /3 < 1 



27-1 ■ 
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Similarly as above for the consistency statement we now translate the central 
limit theorem to X"'^ defined in (6.2). 

Corollary 6.9. Let g be such that Pg{x) : Sd ^ Sd, x t-^ Pgi^) has a dijjerentiable 
inverse. Then under the assumptions of Theorem\6.i 



converges as n, N ^ oo for each t G [0, T] stably in law to a J^- conditional Gaussian 
random variable given by 

Mt = iVpgiXt)y'Nt, 
where Nt denotes the limit of ( |6.8[ ) . In particular the covariance of Mt is given by 

Uff = ^ (Vp,(XO),:^.L(Vp,(X,)),:;j,,,,K'='^'''. 

k,Lk',l' 

Proof. This assertion follows from the concept of stable convergence and is known 
as generalized A- method (see [HI Theorem 1.10]). □ 

7. Covariance of covariance estimation 

Having reconstructed the path of the instantaneous covariance, we can now pro- 
ceed with step (3), that is, the estimation of functions of the integrated covariance. 
To this end we plug the reconstructed path of the instantaneous covariance process 
into jump robust estimators as introduced in Section [5j 

[niT] 

y(X"'^,/,0)?? := - V fiV^A^X"-^), 

whe re / : Sd M'*^^''^ satisfies certain properties which are specified in Theo- 
below. In order to formulate the following theorem, we denote by the 



7.1 



diffusion part of X, that is, 



Jo .^^ Jo 



and by Y we mean 



Yr=V+l bYd-s+ f JxpdZ,+ f f xmi^'^{d^,d.s)-KsidOds) 
Jo Jo Jo Jm.'^ 

+ f f ^l^^{d^,ds). 

Jo Jr'' 



Moreover, 



and 



1 ^ 



— ~^n.N ( n.N 

— n ( YD 



Theorem 7.1. • Assume that X has jumps are of finite activity and sup- 

pose that the paths of X^ are almost surely Holder continuous with expo- 
nent 6. 
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• Let 1 < 7 < 2(5 + 1 and suppose that lim = K for some constant K > 0. 

• Let the conditions (HI) and (K) or {K') and L{ri) with rj > ^^^^ he in 
force and suppose that the instantaneous covariance process q of X defined 
by qs,zji'j' = Y.k,iQt,ijQ\,i'j' satisfies [HI). 

• Let g be such that x i— )■ (Vpg(a;))~^ exists and is continuous. 

• Suppose that f : Sd ^ satisfies either (K) or (K') and is addition- 
ally globally a-Holder continuous for a e (0, 1). 

• Let L < ^^^T-^ and assume that lim ^ — li. for some constant K > Q. 
Then 



Pf{qs)ds 



converges as n, N,m ^ oo stably in law to a J- -conditional Gaussian random vari- 
able defined on an extension of the original probability space with mean and co- 
variance function given by 

i-T 



F, 



ijkli'j' k'l' 



1 

k Jo 

Proof. Let us decompose 
(7.1) 

V^lv{r^^^,f,o) 



Pf{qs)ds = /, 0)?? - V{X^, /, 0)?,') 



+ 



+ V^lv{x^,f,oy, 



PfiQs)ds) 



In view [5l Theorem 5.3.5 and 5.3.6] and the relation between m and n, the second 
term converges to the stated Gaussian random variable. Hence we only have to 
prove that the first term converges to in probability. 



Decomposing X'^'^ into 



■ N 



..N 



the first term of the right hand side of (7.1) can be estimated due to the assumptions 
on / by 



ViX"-'\f,0)^-ViX^J,0)^ 



< m 2 



p=i 

[mT\ 



7n vD') 



A^iXD) 



..N 



< m 2 2Csup 
p 



iX^t 



X: 



m 2 2C sup 
p 



X-f-{XD)^ 



..N 



By Lemma |7.2| and |7.3| below, the relation between m and n and the condition on 
t, both summands converge to in probability, which proves the theorem. □ 
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Lemma 7.2. Let the conditions of Theorem \6.6\ be in force and let g be such that 
X H- (V Pg{x))~^ exists and is continuous. Moreover, consider a sequence {tp)p in 
[0, T] and let a e (0, 1]. Then for every 6 < 



27 

n sup 1 1 X,' -XtJ 



0. 



Proof. By localizing we can assume that X is uniformly bounded. In fact, consider 
a localizing sequence 

Tfc-inf{t>0||lXt|l >fc}, keN, 

and the processes 

rt rtATk rt /• 

Y{k)t = y+ bUs+ ^sdZ,+ / xmi^^{di,ds)^K,{d(,)ds) 

Jo Jo Jo Jm'^ 

i^-xiO)l^'^idC,ds), 

i 

X{k)t ^ Xtl{t<rt,}, 

where {Xt{k))t>o is uniformly bounded by definition. Moreover, define 



Pa{X{k))t 



1 ^ / 
r Y. 1 



T 



and 



N 



e'^''ViYik),g,k)^ 



X{k\' ■.= p-^[pg{X(k))^' 
Then the left hand side of 



n"'" supljX 
p 



n,N 



Xt. 



> e 



< P [Tfc < T] 



^"''sup||X(fc), -X{k)tjr>e 



tends to 0, if the second term on the right hand side does. Therefore, we can assume 
uniform boundedness of X. 

Furthermore, by the continuous mapping theorem it suffices to prove 



6 II vn.N -ir 

n sup II X' -Xt^ 



0. 



By the mean value theorem we obtain the identity 



(7.2) n^upllX,': 
p 



Xt 



n sup 

p 



where Cj^'™ is a random variable satisfying ||Ct"'^ ^ ^tpW < 11-''^*"'^^ ~ 
to the continuity assumption on x i-> iV pg{x))^^ and boundedness of X, (7.2) 
converges to in probability if 

n^up||p;(X)"''^-p,(X,J|| ^0. 
p 

An inspection of the proof of Theorem |6.6| reveals that this is the case for all 



Due 



< 



7-1 
27 



□ 
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Lemma 7.3. Assume that X has jumps of of finite activity and suppose that the 
paths of are almost surely Holder continuous with exponent 5 . Let 1 < 7 < 2(5+1 
and suppose that lim — K for some constant K > Q. Assume that the conditions 
(HI) and (K) or {K') and L{rj) with rj > are in force and g be such that 

X I— >■ (Vpg(a;))^^ exists and is continuous. Consider a sequence {tp)p in [0,T] and 
let a G (0, 1] . Then for every 6 < 



n sup 



0. 



Proof. By arguing similarly as in Lemma 7.2 above, we can assume that X, X^ 
and b^{x') are uniformly bounded by a constant, where b^{x') is defined in (5.3 1. 
The mean value theorem yields 



da M vn.N 

n""- supllX,' 



n sup 

p 



where Ct"'^ is a random variable satisfying — || < WX"^^ — {X^)^ 

Due to the continuity assumption on x (Vpg(x))~^ and boundedness of X, (7.2) 
converges to in probability if 



n sup 

p 

.N 



. -n.N n.N 

p,iX). -p,{X^\ 



0. 



By definition of Pg{X) and abbreviating F/v {^{tp — ^m-i)) by F^, 



\\p,{x\' -pM^'V II' 



can be written as 
(7.3) 



sup 

p 



L«TJ 



m— 1 



L«TJ 



< sup ^ ( -F^ ) g(V^Ar„i?^(x')) - giV^KaD""" {^)) " l{As.y#As.y-} 

P m=l V" ^ 

+ sup E \\g{V^KJ) ~ 5(y^Ar„i?^(x'))ir l{A;^y^As.y-} 



LnTJ 



p 1 \" / 



l{A;;,y^A;^yJ^}- 



By the boundedness of {x'), X and and the polynomial growth of g, 



E 



iVnA:,D^{x'))~giVnA-,D^ (x')) 
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is bounded by a constant. Therefore and due to the finite activity of the jumps of 
X (note that A^^^F 7^ KJ^ only if X jumps) 



E 



LnTJ 



m— 1 



is bounded as well. This and the form of the Fejer kernel implies that the first 



sumniand in (7.3) multiplied with n can be estimated by 

n 



Cr. 



Jn""n -I . 



which converges to due to the condition on 9. As a consequence of the 
remaining two summands in (7.3) multiplied with n^" converge even faster to and 
hence the assertion follows. □ 



8. Simulation results 

In this section we illustrate our theoretical results in the case of a multivariate 
affine model, where both the log-price Y and the instantaneous covariance process 
X can jump. More precisely, we consider a multivariate Bates-type model of the 
form 

Yt^y+ f hsds+ [ ^/x;ZdZ,+ f f 
Jo Jo Jo Js.'' 



Xt=x + [ {h + MXt+ XtM'^)dt+ ^/XtdBtT. 
Jo 

+ [ [ ifi''{d(,ds), 
Jo Jst 



EdBj^ 



'Xt 



where 



• Z is a d-dimensional Brownian motion correlated with the d x d matrix of 
Brownian motions B such that Z = — pW + Bp, where p G [—1,1]^ 
such that p^ p <1 and is a d-dimensional Brownian motion independent 
of B, 

• 11^ {d^, dt) is the random measure associated with the jumps of Y, whose 
compensator is given by J2i=i ^^F'^'{d£,i)dt, where A^' > and i^^* de- 
notes the Gaussian density with mean pi and standard deviation (7^, 

• p^ {d£,, dt) is the random measure associated with the jumps of X, whose 
compensator is given by X-^^^ F^^^{d£,ii)dt, where A'^^^ > and F^'^'^ 
denotes the density of the exponential distribution with parameter 

• the drift of Y is given by bg^i = —^Xgji — X^^{p^' 

• the parameters of X satisfy M G , ^ , u ^ 

&-(d-l)s2G5+. 

Note that the truncation function of Y is here chosen to be 0. 

As described in Section [T] and Section |4] we aim to recover the instantaneous 
covariance process X and the parameters a := and p from observations of Y. 
In order to be in accordance with market specifications, we simulate Y and X on 



-iX,,,, -A*''(e'^--4< -1) and 
R'^'"^, Y.e S+beS+ such that 



We here only suppose that Xn can jump. 
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n — 127750 — 511 * 250 grid points, which corresponds to 1 year {T = 1) of 1- 
minute data. For our numerical simulation, we consider the case d = 2 and use the 
following parameter values: 



-^^0,11 
-^^0.12 



-^0,12 
-^0,22 



M 



E2 



(M1,M2) 



(0,0) 



0.09 
-0.036 



-0.036 
0.09 



-1.6 -0.2 

-0.4 -1 

0.0725 om 

0.06 0.1325 



3.5a 



(-0.3,-0.5) 



(100, 100) 



(-0.005,-0.003) 



(0.015,0.02) 



10 



0^05 

In order to illustrate in particular that our estimator is robust to small and 
frequent jumps, the jump intensity of both log-prices is chosen to be quite high. 
Figure [2] and [3] below show simulated trajectories of the log-price and the instan- 
taneous covariance process, where the jumps are removed in the second graph in 
each case. 

A comparison between the reconstructed and simulated trajectories of the in- 
stantaneous covariance process is shown in Figure |4] These figures illustrate that - 
even in the case of (frequent) jumps in the log-price and in the variance (as it is the 
case for Xu) - the paths of X can be recovered very well. For the reconstruction 
of the trajectories of X we choose N = 210 Fourier coefficients, which corresponds 
to the choice 7 « 2 and K « 3, as specified in Theorem 6.6 This is a reasonable 



choice in view of an acceptable bias (in particular due to the fact that we do not 
sum all Fourier coefficients) and a rather small variance. Both, the simulated as 
well as the reconstructed trajectories are evaluated at 2N + 1 points. In our con- 



crete implementation the estimator for the Fourier coefficients (5.1 1 is based on the 
Tauchen-Todorov specification of the function 5, that is, 



9 



S2, (2/1, y2) ^^ {cOs{y^ + l{j^j}yj ))j je{l,2} • 



In this case 



£-3(2:11+23:12+2:22) 



'5(2:ii+2a:i2+a;22) 
^-52:22 



and X^''^ is obtained by 



.,N 
tAi 



2log[pgjX\ 



X. 



i,12 



.N 



*e {1,2}, 



21og p,.^.(X)^ 



..N 



X 



t,ll 



X. 



..N 
i,22 



FOURIER TRANSFORM METHODS FOR PATHWISE COVARIANCE ESTIMATION 43 



5 


Log-prices on n=127750 points 






Y1 

■■12 _ 


4 






3 






2 






g- 01 







-0 1 






-0 2 






-0 3 







time 

Log-prices without jumps on n=12775D points 




time 

Figure 2. Simulated log-price with and without jumps on n = 
127750 points 



The reconstructed trajectories of X are then used to estimate the parameters a 
and p. To this end, we use the power variation estimators, i.e., 

^ m 

^ m 
Tfi 

p=l 

where 
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Instantaneous covariance on n=127750 points 




Figure 3. Simulated instantaneous covariance with and without 
jumps on n = 127750 points 



These quantities are estimators for the power (co)variation of X and Y. Indeed we 
have under the assumptions of Theorem |7.1| 

»i 



25r 



r + 1 



23r 



r + 1 



- -P^42(-'^, an, 0:12,^22), 



r + l\ + 1 



i2^r 

TT 



2' 2' 2' I 



AauXs^zi 2(Vap)i 

2{y^p)iXs,ll Xs^u 



(4«..)S / XJ^ ds 



ds 



--: PCuiX,Y,a,p) 
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as m, n, iV — 7> oo. The formulas on the right hand sides follow from the expressions 
for the absolute moments of the bivariate Gaussian distribution (see, e.g., [TO]) and 
2F1 (a, b; c; x) denotes the Gaussian hypergeometric function. 
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The estimators for a and p can now be constructed via 



0)r - PVuiX"''', an, ai2, ^22))^ 



argmm 

ai2e[-\/3llS22.\/3ll322] 



P = ar, 



irgmin V F,, 0)^ - PQ,(X">^, F, S, p)) ' 



In our simulation study, we choose r = ^ in /r, 0)™ and r = 1 in 

V{X'lf Jr,^)^ for {ij) = (12) and {ij) = (22) respectively. This is due to the 
fact that X\\ exhibits jumps and taking a lower power reduces the contribution of 
jumps in the power variation. In y(Xj"'''^, F^, /^^s, 0)™, t and s are chosen to be 
i . Figure [s] and |6] show the estimated values for a and p as a function of the grid 
points TO. ^s a consequence of Theorem 7.1 the grid corresponding to to has to be 



coarsened considerably with respect to the initial gridding with n points (of order 
or even more depending on the power used). For this reason the number of grid 
points shown in the graphs is rather small. Nevertheless the estimation results are 
good approximations of the true parameter values and can further be improved by 
increasing n and thus in turn also m. 
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Figure 5. Estimation of a as a function of the grid points 
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Figure 6. Estimation of p as a function of the grid points 



